智能论文笔记

Evaluating robustness of You Only Hear Once(YOHO) Algorithm on noisy audios in the VOICe Dataset

Soham Tiwari , Kshitiz Lakhotia , Manjunath Mulimani

分类：自然语言处理

2021-11-01

机器侦听中的声音事件检测（SED）需要识别音频文件中的不同声音，并识别音频中特定声音事件的开始和结束时间。 SED在多媒体数据库中发现了在各种应用中的应用，例如音频监控，语音识别和基于上下文的索引和检索数据。然而，在现实生活场景中，来自各种来源的声音很少没有任何干扰噪音或干扰。在本文中，我们在嘈杂的音频数据上测试您只听到一次（Yoho）算法的性能。由您的灵感仅仅看一次（YOLO）算法在计算机视觉中，yoho算法可以匹配数据集上各种最先进的算法的性能，如音乐语音检测数据集，TUT声音事件和城市 - SED数据集，但在较低的推理时间。在本文中，我们探讨了Yoho算法在包含不同声音噪声比（SNR）的噪声的音频文件的语音数据集的性能。 Yoho可以胜过或至少匹配语音数据集纸中的最佳性能SED算法，并在更短的时间内进行推断。

translated by 谷歌翻译

3D Neuron Morphology Analysis

Jiaxiang Jiang , Michael Goebel , Cezar Borba , William Smith , B. S. Manjunath

分类：计算机视觉

2022-12-14

We consider the problem of finding an accurate representation of neuron shapes, extracting sub-cellular features, and classifying neurons based on neuron shapes. In neuroscience research, the skeleton representation is often used as a compact and abstract representation of neuron shapes. However, existing methods are limited to getting and analyzing "curve" skeletons which can only be applied for tubular shapes. This paper presents a 3D neuron morphology analysis method for more general and complex neuron shapes. First, we introduce the concept of skeleton mesh to represent general neuron shapes and propose a novel method for computing mesh representations from 3D surface point clouds. A skeleton graph is then obtained from skeleton mesh and is used to extract sub-cellular features. Finally, an unsupervised learning method is used to embed the skeleton graph for neuron classification. Extensive experiment results are provided and demonstrate the robustness of our method to analyze neuron morphology.

translated by 谷歌翻译

RT-1: Robotics Transformer for Real-World Control at Scale

Anthony Brohan , Noah Brown , Justice Carbajal , Yevgen Chebotar , Joseph Dabis , Chelsea Finn , Keerthana Gopalakrishnan , Karol Hausman , Alex Herzog , Jasmine Hsu

分类：机器人 | 人工智能 | 自然语言处理 | 计算机视觉 | 机器学习

2022-12-13

By transferring knowledge from large, diverse, task-agnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small task-specific datasets to a high level of performance. While this capability has been demonstrated in other fields such as computer vision, natural language processing or speech recognition, it remains to be shown in robotics, where the generalization capabilities of the models are particularly critical due to the difficulty of collecting real-world robotic data. We argue that one of the keys to the success of such general robotic models lies with open-ended task-agnostic training, combined with high-capacity architectures that can absorb all of the diverse, robotic data. In this paper, we present a model class, dubbed Robotics Transformer, that exhibits promising scalable model properties. We verify our conclusions in a study of different model classes and their ability to generalize as a function of the data size, model size, and data diversity based on a large-scale data collection on real robots performing real-world tasks. The project's website and videos can be found at robotics-transformer.github.io

translated by 谷歌翻译

MalGrid: Visualization Of Binary Features In Large Malware Corpora

Tajuddin Manhar Mohammed , Lakshmanan Nataraj , Satish Chikkagoudar , Shivkumar Chandrasekaran , B. S. Manjunath

分类：计算机视觉 | 机器学习

2022-11-04

The number of malware is constantly on the rise. Though most new malware are modifications of existing ones, their sheer number is quite overwhelming. In this paper, we present a novel system to visualize and map millions of malware to points in a 2-dimensional (2D) spatial grid. This enables visualizing relationships within large malware datasets that can be used to develop triage solutions to screen different malware rapidly and provide situational awareness. Our approach links two visualizations within an interactive display. Our first view is a spatial point-based visualization of similarity among the samples based on a reduced dimensional projection of binary feature representations of malware. Our second spatial grid-based view provides a better insight into similarities and differences between selected malware samples in terms of the binary-based visual representations they share. We also provide a case study where the effect of packing on the malware data is correlated with the complexity of the packing algorithm.

translated by 谷歌翻译

Transport in reservoir computing

G Manjunath , Juan-Pablo Ortega

分类：神经与进化计算

2022-09-16

储层计算系统是使用驱动的动力系统构建的，在该系统中，外部输入可以改变系统的发展状态。这些范例用于信息处理，机器学习和计算。在此框架中需要解决的一个基本问题是输入与系统状态之间的统计关系。本文提供的条件可以保证驱动系统的渐近措施的存在和唯一性，并表明当输入和输出过程的集合赋予了Wasserstein距离时，它们对输入过程的依赖性是连续的。这些发展中的主要工具是将这些不变的度量表征为在这种情况下出现并在论文中进行了大量研究的自然定义的FOIA算子的固定点。这些固定点是通过在驱动系统中施加新引入的随机状态合同性来获得的，该系统在示例中很容易验证。可以通过非国家缩减的系统来满足随机状态的合同性，这通常是为了保证储层计算中的回声状态属性的需求。结果，即使不存在Echo State属性，也可能会得到满足。

translated by 谷歌翻译

Deep Learning Enabled Time-Lapse 3D Cell Analysis

Jiaxiang Jiang , Amil Khan , S. Shailja , Samuel A. Belteton , Michael Goebel , Daniel B. Szymanski , B. S. Manjunath

分类：计算机视觉

2022-08-17

本文提出了一种延时3D细胞分析的方法。具体而言，我们考虑了准确定位和定量分析亚细胞特征的问题，以及从延时3D共聚焦细胞图像堆栈跟踪单个细胞的问题。细胞的异质性和多维图像的体积提出了对细胞形态发生和发育的完全自动化分析的主要挑战。本文是由路面细胞生长过程和构建定量形态发生模型的动机。我们提出了一种基于深度特征的分割方法，以准确检测和标记每个细胞区域。基于邻接图的方法用于提取分段细胞的亚细胞特征。最后，提出了使用多个单元格特征的基于强大的图形跟踪算法在不同的时间实例中关联单元格。提供了广泛的实验结果，并证明了所提出的方法的鲁棒性。该代码可在GitHub上获得，该方法可通过Bisque Portal作为服务可用。

translated by 谷歌翻译

OMD: Orthogonal Malware Detection Using Audio, Image, and Static Features

Lakshmanan Nataraj , Tajuddin Manhar Mohammed , Tejaswi Nanjundaswamy , Satish Chikkagoudar , Shivkumar Chandrasekaran , B. S. Manjunath

分类：计算机视觉 | 机器学习

2021-11-08

通过越来越多的恶意软件和网络攻击，需要“正交”网络防御方法，其通过检测不被其他方法预测的唯一恶意软件样本来互补。在本文中，我们提出了一种新颖和正交的恶意软件检测（OMD）方法来使用音频描述符，图像相似性描述符和其他静态/统计特征的组合来识别恶意软件。首先，我们展示当恶意软件二进制文件表示为音频信号时，如何如何在分类恶意软件系列方面有效。然后，我们表明对音频描述符的预测与对图像相似性描述符和其他静态特征的预测正交。此外，我们开发了一个错误分析的框架和度量标准，以量化正交的新功能集（或类型）是关于其他特征集的方式。这允许我们为我们的整体框架添加新功能和检测方法。恶意软件数据集的实验结果表明，我们的方法为正交恶意软件检测提供了一种强大的框架。

translated by 谷歌翻译

HAPSSA: Holistic Approach to PDF Malware Detection Using Signal and Statistical Analysis

Tajuddin Manhar Mohammed , Lakshmanan Nataraj , Satish Chikkagoudar , Shivkumar Chandrasekaran , B. S. Manjunath

分类：机器学习

2021-11-08

恶意PDF文件对需要现代威胁情报平台的各种安全组织提出了严重威胁，以有效分析和表征PDF恶意软件的身份和行为。最先进的方法使用机器学习（ml）来学习特征PDF恶意软件的功能。然而，ML模型通常很容易受到逃避攻击的影响，其中对手会使恶意软件代码禁止以避免被防病毒检测到。在本文中，我们推出了一种简单而有效的整体方法，用于PDF恶意软件检测，利用了恶意软件二进制文件的信号和统计分析。这包括组合来自各种静态和动态恶意软件检测方法的正交特征空间模型，以便在面对代码混淆时启用广泛的鲁棒性。使用包含恶意软件和良性示例的近30,000个PDF文件的数据集，我们表明我们的整体方法维持了高检测率（99.92％）的PDF恶意软件，甚至可以检测通过简单方法创建的新的恶意文件，以删除恶意软件所进行的混淆作者隐藏他们的恶意软件，这些恶意软件被大多数杀毒失真。

translated by 谷歌翻译

Embedding Information onto a Dynamical System

G Manjunath

分类：机器学习

2021-05-22

庆祝的Takens嵌入定理界面涉及通过通用延迟观察地图在适当的维度的欧几里德空间中嵌入动态系统的吸引子。嵌入也建立了一种拓扑共轭。在本文中，我们展示了如何将任意序列映射到另一个空间中作为非自治动态系统的吸引力的解决方案。这种映射还需要拓扑缀合物和序列与吸引人的解决方案空间之间的嵌入。这一结果不是嵌入定理的Takens的概括，但有助于我们了解所广泛用于应用程序的离散时间状态模型所需的究竟是什么，以将外部刺激嵌入到其解决方案上。我们的成果解决了关于自主动态系统扰动的另一个基本问题。我们描述了当外源噪声渗透到连续的局部不可缩续的吸引装置（如稳定的固定点）的离散时间自主动态系统的局部不可缩短的局部噪声时究竟发生了什么。

translated by 谷歌翻译